10 research outputs found

    Question Generation for French: Collating Parsers and Paraphrasing Questions

    Get PDF
    This article describes a question generation system for French. The transformation of declarative sentences into questions relies on two different syntactic parsers and named entity recognition tools. This makes it possible to further diversify the questions generated and to possibly alleviate the problems inherent to the analysis tools. The system also generates reformulations for the questions based on variations in the question words, inducing answers with different granularities, and nominalisations of action verbs. We evaluate the questions generated for sentences extracted from two different corpora: a corpus of newspaper articles used for the CLEF Question Answering evaluation campaign and a corpus of simplified online encyclopedia articles. The evaluation shows that the system is able to generate a majority of good and medium quality questions. We also present an original evaluation of the question generation system using the question analysis module of a question answering system

    Covid-19 et communication de crise. Focus linguistique sur les tweets francophones de Belgique

    No full text
    The Covid-19 health crisis of 2020 strongly affected the international community. Especially during the lockdown period, social media were widely used for information and emotion sharing. This article aims to keep a material trace of these crisis communication trends. To reach this aim, we applied quantitative and qualitative methods on a corpus of 100,000 tweets we collected in the French-speaking part of Belgium. The corpus is divided into three sub-groups: citizens, politicians and the media. We first present the corpus collection and the methodology we followed. We also look at the lexical creativity that resulted from the crisis and the lockdown situation. We then propose a semantic approach of the themes that emerged from the crisis tweets and which highlight citizens’ concerns. Finally, we depict the personalities related to the crisis, by focusing on their communication and on the image they portray to the public

    Collection of Twitter Corpora for Human and Social Sciences: Sampling Methodology and Evaluation

    No full text
    The increasing popularity of electronic messages challenges social science researchers, particularly regarding data representativeness and quality. We propose and evaluate a methodology to create a corpus of Twitter data for a given population by sampling the targeted user population. In our case, the population is the groups of citizens, media and politicians using Twitter in Belgium (French and Dutch languages), Norway and France. We present in particular a machine-learning based methodology that enables the population sampling. We also present a methodology to evaluate the representativeness of our corpus compared to the Full Twitter stream of the targeted population. https://osf.io/gse4k

    Named Entity Recognition for German Using Conditional Random Fields and Linguistic Resources

    Get PDF
    International audienceThis paper presents a Named Entity Recognition system for German based on Conditional Random Fields. The model also includes language-independent features and features computed form large coverage lexical resources. Along side the results themselves, we show that by adding linguistic resources to a probabilistic model, the results improve significantly

    Named Entity Recognition for German Using Conditional Random Fields and Linguistic Resources

    No full text
    International audienceThis paper presents a Named Entity Recognition system for German based on Conditional Random Fields. The model also includes language-independent features and features computed form large coverage lexical resources. Along side the results themselves, we show that by adding linguistic resources to a probabilistic model, the results improve significantly

    Covid-19 and social media: a diachronic discourse analysis for the modeling of linguistic patterns during crises

    No full text
    In this presentation, we focus on the diachronic evolution of linguistic framings of the pandemic from the emergence of the disease in China to reopening plans, through lockdown measures in Belgium. We notably try to identify formula of disincentives to the acceptance of political measures by citizens. Through the analysis of messages over time, our aim is to find out possible discourse/reaction curves that could model the linguistic pattern of the incentives-disincentives variation that happens during the different stages of the pandemic crisis

    High Risk of Anal and Rectal Cancer in Patients With Anal and/or Perianal Crohn’s Disease

    No full text
    International audienceBackground & AimsLittle is known about the magnitude of the risk of anal and rectal cancer in patients with anal and/or perineal Crohn’s disease. We aimed to assess the risk of anal and rectal cancer in patients with Crohn’s perianal disease followed up in the Cancers Et Surrisque Associé aux Maladies Inflammatoires Intestinales En France (CESAME) cohort.MethodsWe collected data from 19,486 patients with inflammatory bowel disease (IBD) enrolled in the observational CESAME study in France, from May 2004 through June 2005; 14.9% of participants had past or current anal and/or perianal Crohn’s disease. Subjects were followed up for a median time of 35 months (interquartile range, 29–40 mo). To identify risk factors for anal cancer in the total CESAME population, we performed a case-control study in which participants were matched for age and sex.ResultsAmong the total IBD population, 8 patients developed anal cancer and 14 patients developed rectal cancer. In the subgroup of 2911 patients with past or current anal and/or perianal Crohn’s lesions at cohort entry, 2 developed anal squamous-cell carcinoma, 3 developed perianal fistula–related adenocarcinoma, and 6 developed rectal cancer. The corresponding incidence rates were 0.26 per 1000 patient-years for anal squamous-cell carcinoma, 0.38 per 1000 patient-years for perianal fistula–related adenocarcinoma, and 0.77 per 1000 patient-years for rectal cancer. Among the 16,575 patients with ulcerative colitis or Crohn’s disease without anal or perianal lesions, the incidence rate of anal cancer was 0.08 per 1000 patient-years and of rectal cancer was 0.21 per 1000 patient-years. Among factors tested by univariate conditional regression (IBD subtype, disease duration, exposure to immune-suppressive therapy, presence of past or current anal and/or perianal lesions), the presence of past or current anal and/or perianal lesions at cohort entry was the only factor significantly associated with development of anal cancer (odds ratio, 11.2; 95% CI, 1.18-551.51; P = .03).ConclusionsIn an analysis of data from the CESAME cohort in France, patients with anal and/or perianal Crohn’s disease have a high risk of anal cancer, including perianal fistula–related cancer, and a high risk of rectal cancer

    Low incidence of SARS-CoV-2, risk factors of mortality and the course of illness in the French national cohort of dialysis patients

    No full text
    International audienceThe aim of this study was to estimate the incidence of COVID-19 disease in the French national population of dialysis patients, their course of illness and to identify the risk factors associated with mortality. Our study included all patients on dialysis recorded in the French REIN Registry in April 2020. Clinical characteristics at last follow-up and the evolution of COVID-19 illness severity over time were recorded for diagnosed cases (either suspicious clinical symptoms, characteristic signs on the chest scan or a positive reverse transcription polymerase chain reaction) for SARS-CoV-2. A total of 1,621 infected patients were reported on the REIN registry from March 16th, 2020 to May 4th, 2020. Of these, 344 died. The prevalence of COVID-19 patients varied from less than 1% to 10% between regions. The probability of being a case was higher in males, patients with diabetes, those in need of assistance for transfer or treated at a self-care unit. Dialysis at home was associated with a lower probability of being infected as was being a smoker, a former smoker, having an active malignancy, or peripheral vascular disease. Mortality in diagnosed cases (21%) was associated with the same causes as in the general population. Higher age, hypoalbuminemia and the presence of an ischemic heart disease were statistically independently associated with a higher risk of death. Being treated at a selfcare unit was associated with a lower risk. Thus, our study showed a relatively low frequency of COVID-19 among dialysis patients contrary to what might have been assumed

    Low incidence of SARS-CoV-2, risk factors of mortality and the course of illness in the French national cohort of dialysis patients

    No full text
    corecore